Data is a raw material for making informed decisions, answer critical questions and make it possible to gain competitive advantage. The considerable challenge facing both commercial and none profit organizations today is the explosion of data. Before using data, however, considerable work need to be done to prepare it for analysis. Data will need to be formatted, statically summarized, and graphically visualized and documented.

Regardless of the industry vertical a customer belongs, the modular data services listed below will benefit it by harnessing insight learned and pattern detected to make important decisions.

Request for a free consultation to start discussing your requirments and what you want to achieve, and I will be happy to comeback to you with a proposal that includes my process, plan of action and deliverable timeline. If you are not satisfied with the services provided, you will be fully reimbursed.

Data Cleaning and Prepration

Description:

Quality data is the most important aspect of data analytic. Starting from a raw data and gaining a dataset that is relevant, accurate and connected, literally determines the success or failure of the question a customer wants to answer or make decision on.

Key Capabilities:

  • Whether the data is structured or unstructured, I will apply best practice methods and use data cleaning best practices to clean, wrangle and feature engineer your data to enhance its quality.
  • Pull data that exist in different formats, such as R, Excel, MiniTab, STATA, SAS, SPSS, plain text, and blend them together to generate consistent dataset for smooth data exploration and analysis.
  • Use Application Programming Interface (API) to tap into data warehouses from US government (ex. US Census), international organizations (ex. UN/WHO/WB), Municipalities, national statistical associations (ex. Federal Reserve Bank of St. Louis - Economic Research County), and commercial entities to spawn high quality datasets.

Data driven Graphics and Geographic Maps

Description:

Visualizing data goes a long way in help customers understand their data. Applying statistical analysis to data and overlaying it on geographical maps like continents, countries, counties and cities “can help you make meaningful comparisons among thousands of pieces of information, extracting patterns not easily found through other methods.”

Key Capabilities:

  • Generate publication ready statistical data driven graphics, to help visualize quantitative information.
  • Build a interactive web graphic that can be inserted in a document or a web page.
  • Overlay shape and geojson files with Leaflet, a java Script based package that converts static maps into information rich interactive maps. A custom popup information displays can be add.
  • Capture, manipulate, analyze and present raw data, and generate a spatial analytic result in a context of cartographic maps.

Reproducible dynamic Automated Reporting

Description:

A data driven document is a very high quality document that includes interactive tables, images, statistical graphs and geographic maps. The document is reproducible in that the graphs will recompute the results meeting the standard set by scientific research. The report can automatically update itself when the underlying data for the maps change, keeping the report accurate to the present.

Key Capabilities:

  • Apply latest tools that include interactivity of graphs, maps and animated gif’s. The report will have the data processing includes so that it can be fully reproduceable in accordance with , that anyone can rerun to generate the graphics/statistics.
  • Choose and apply several theme to enhance the look and feel, add table of content embed data driven graphics.
  • Generated slide presentation that include embedded interactive dashboard.
  • Generate report as in pdf, word or HTML formats.

Natural Langague processing

Description:

Natural language Processing (NLP) “is field of study that focuses on the interactions between human language and computers, and derive meaning from human language in a smart and useful way.” Organizations maintain most of their data in word and pdf written in a natural language, not in databases. It is one of the richest information set most organization’s posses, but rarely tap into. Gaining access and insight to this information, utilizing analytical NLP packages, will bring tremendous valuable that will give you competitive advantage.

Key Capabilities:

  • Key phrase extraction - Given a document, exploits structure of the words in the document, and determine “central” key phrase and output a list. Similar to Google PageRank selects Web pages.
  • Sentiment Analysis - excerpt subjective information from a document3, to determine sentiment. It is especially useful for identifying trends.
  • Optical character recognition (OCR) ingest - Given an image representing printed text, ingest and tidy text data and prepare for analysis.
  • Text summerization

Interactive Dashboard

Description:

Intuitive powerful web based dashboard that lets you interactively explore, manipulate, monitor and Visualize your data.

Key Capabilities:

Contact:

Drop me email abiyu.giday@gmail.com or follow me on twitter @abiyugiday, send me friend request on linkedin and don’t forget to check my blog with new updates abiyug.github.io.